CDS
Accession Number | TCMCG019C42715 |
gbkey | CDS |
Protein Id | XP_022932100.1 |
Location | join(843910..844176,844550..844722,844888..844960,845240..845374,845492..845702,846708..846856,848157..848255,848486..848564,848675..848804,850481..850575,850660..850768,851442..851607,851708..851869,855751..855910,856586..856755,856935..856979) |
Gene | LOC111438418 |
GeneID | 111438418 |
Organism | Cucurbita moschata |
Protein
Length | 740aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA418582 |
db_source | XM_023076332.1 |
Definition | DNA mismatch repair protein MLH1 isoform X1 [Cucurbita moschata] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGAACCCCACGCGGATGACGAGATTATTCCCATGGACACGGGCGGGGAAGAAGAAGTTCCTCCTCAAGAACCCCCCAAAATCCTCCGACTCGACAACTCCGTCGTCAATCGTATCGCTGCCGGAGAGGTCATTCAAAGGCCAGTGTCCGCCATTAAAGAACTCGTCGAAAACAGCCTCGACGCCCAATCTACCTCCGTTAACGTCGTTGTCAAAGACGGCGGTCTCAAACTCATCCAAGTTTCTGACGACGGCCACGGCATCCGTTATGAAGATTTGCCGATTTTGTGCGAGAGGCACACGACGTCCAAGTTGTCAAAATTTGAGGATTTACAGTCCATAAGGTCGATGGGATTTCGAGGAGAGGCGCTAGCGAGCATGACCTATGTAGGTCATGTTACGGTCACCACCATTACTAAAGGACAACTTCACGGTTACAGAGTATCCTATAGAGATGGAGTGATGGAGCATGAGCCCAAGCCATGTGCTGCTGTAAAAGGAACTCAAATAACGGTTGAGAATCTGTTCTATAATATGAGTGCTAGGAGGAAGACACTACAAAATGTGTCCGATGATTACACGAAGATTGTGGATCTCCTAAGTCGATTTGCCATTCATCATATAAACATCAGCTTTTCTTGCAGAAAGCATGGAGCTGCTAGGGCAGACGTTCACTCAGTTGGGTCAACTTCAAGGTTGGATGCCATTCGTACAGTTTACGGTGCATCAGTTGCTCGCAATCTAATGAAAATAGAAGTTTCAGAAAATGATAAAGCCTGTTCAGATTTCAAAATGGATGGTCTAATCTCCAACTCAAATTATACTGCGAAGAAGATCACAATGGTGCTCTTTATTAATGAAAGAATGGTAGACTGTAGTGCTTTAAAAAGAGCTATTGAAATTGTTTATGCTGCAACCTTGCCCAAAGCATCCAAACCTTTCATATATATGTCAATTATATTGCCACCTGAGCATGTTGATGTGAATGTTCATCCAACCAAAAAAGAGGTAAGCCTCCTGAACCAGGAAGTTATTATTGAGAGGATACAGTCAGCTGTGGAATCAAAATTGAGAAGTTCTAATGACACGAGGACATTTCAAGAACAGGATGTAGAATCTTCTGCGGCTAGTCAAATGGTTATTAGAAGTGACTATACTCAGAATTCCTCGCAGTCTGGTACAGCAGGATCAAAGTCACAGAAGGTTCCAGTGCAAAAAATGGTTAGGACAGATTCAACAGATCCAGCTGGAAGGTTGCACGCATATGTGCAAATGAATCCTCCTGGCCTCCCTGAATCTAGCTTGAATACTGTGAGGTCTTTTGTTAGAATGAGAAGGAATCCAAGGGAAGCTGCTAATCTTACTAGCGTTCAAGATCTTGTTGCAGAAATTGATCAGAATTGTCATGCTGGTCTCCTTAACACTGTAAGACATTGTGTATATATTGGAATGGCAGATGACGTCTTCGCTTTGCTTCAGCATGATACTCATCTTTATCTAGCCAATGTTGTGAACTTGAGCAAAGAACTCATGTATCAGCAAGTGTTATGTCGATTTGCACATTTTAATGCAATACAATTGAGCAACCCAGCCCCTCTGTACGAGTTAATTAGTTTGGCACTGAGGGAGGAAGATGTGAATTCAGAGTCTAATGAGAATGATGATTTTAATAAGAAGGTAGCTGAGACGAGTACAAAACTGCTCAAGTTGAAAGCTGAAATGCTCGAGGAATTTTTCTGCATACATATTGACGTAAATGGAAATTTGGCGAGACTTCCAGTCGTACTTGACCAATACACACCTGATATGGACCGTGTTCCTGAATTTGTACTTTCCTTGGCTAATGATATTGATTGGGAAGATGAGAAAAATTGTATCCAGTCGATTTCAGCTGCCATTGGGAACTTCTATGCCATGCATCCTCCCTTGCTGCCAAATCCATCGGGTGATGGCTTGCAGTTCTACAAAAGGATAAAATCATCCGGGAATCCTGAAGATGAAAATATAGGTGCAGATGATACGATGACAATGGAGAATGAAATCAACCACGGTCTACTATCGGAGGCAGAAACCATATGGGCTCAACGTGAATGGTCAATACAGCATGTACTCATCCCATCAATGAAACTTTTCTTCAAGCCTCCACATTCTCTCGCTGAAAATGGATCTTTCATTCGGGTTGCATCATTAGAGAGACTTTACAAGATCTTTGAGAGATGTTGA |
Protein: MEPHADDEIIPMDTGGEEEVPPQEPPKILRLDNSVVNRIAAGEVIQRPVSAIKELVENSLDAQSTSVNVVVKDGGLKLIQVSDDGHGIRYEDLPILCERHTTSKLSKFEDLQSIRSMGFRGEALASMTYVGHVTVTTITKGQLHGYRVSYRDGVMEHEPKPCAAVKGTQITVENLFYNMSARRKTLQNVSDDYTKIVDLLSRFAIHHINISFSCRKHGAARADVHSVGSTSRLDAIRTVYGASVARNLMKIEVSENDKACSDFKMDGLISNSNYTAKKITMVLFINERMVDCSALKRAIEIVYAATLPKASKPFIYMSIILPPEHVDVNVHPTKKEVSLLNQEVIIERIQSAVESKLRSSNDTRTFQEQDVESSAASQMVIRSDYTQNSSQSGTAGSKSQKVPVQKMVRTDSTDPAGRLHAYVQMNPPGLPESSLNTVRSFVRMRRNPREAANLTSVQDLVAEIDQNCHAGLLNTVRHCVYIGMADDVFALLQHDTHLYLANVVNLSKELMYQQVLCRFAHFNAIQLSNPAPLYELISLALREEDVNSESNENDDFNKKVAETSTKLLKLKAEMLEEFFCIHIDVNGNLARLPVVLDQYTPDMDRVPEFVLSLANDIDWEDEKNCIQSISAAIGNFYAMHPPLLPNPSGDGLQFYKRIKSSGNPEDENIGADDTMTMENEINHGLLSEAETIWAQREWSIQHVLIPSMKLFFKPPHSLAENGSFIRVASLERLYKIFERC |